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Abstract 



A simple expression is derived for the terms in the Baker-Campbell- 
Hausdorff series. One formulation of the result involves a finite number of 
operations with matrices of rational numbers. Generalizations are discussed. 
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I. INTRODUCTION 



The Baker-Campbell-Hausdorff series has a long history and has applications in a wide 
variety of problems, as explained in Refs. p~3[| . In a classic paper, Goldberg |l| was able to 



derive an integral expression for the coefficients in the general term, and this result is still 
used today to calculate the Baker-Campbell-Hausdorff series. 

In this paper, we present a simple method for calculating the terms in the Baker- 
Campbell-Hausdorff series. The process can be carried out by hand, and it is easily im- 
plemented on a computer. 



II. STATEMENT OF THEOREM 

We let z = log(e x e y ) denote the Baker-Campbell-Hausdorff series for noncommuting 
variables x and y. Our result for the n-th order term z n in this series is given by the 
following procedure, which involves only a finite number of matrix multiplications. We state 
our results without reference to commutators. If an expression in terms of commutators is 
desired, our expression can be transformed using the substitution due to Dynkin |||5|-[7|] . 
Each product involving x and y variables is replaced by - times the corresponding iterated 
commutator of the same sequence of x's and y's. The quantity z n is invariant under this 
transformation. 

To calculate z n , the entire n-th order term in the Baker-Campbell-Hausdorff series, we 
compute a certain polynomial in n (ordinary commuting) variables, ai,...,a n , and then 
make a replacement, as described below. We begin by defining two (n + 1) x (n + 1) matrices 
F and G by 



I 



u-iy 



(2.1) 



and 



G, 



3-1 

n 

k=i 



(2.2) 



These equations are valid for all % and j from 1 to n + 1, with the usual convention that the 
reciprocal of the factorial of a negative integer is zero. Written out explicitly, the matrices 
are 
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2 6 

1 1 I 



\ 



(2.3) 



1 7 



and 



2 



(\ 



G 



1 <7 2 
1 



\ 



0"3 



Or, 
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(2.4) 



1 / 



Although it is not necessary for the calculation of results, we point out at this point that the 
matrices F and G are exponentials of very simple matrices. We define two (n + 1) x (n + 1) 
matrices M and iV by 



H+l,j 



and 



(2.5) 



(2.6) 



These equations are valid for % and j ranging from 1 to n + 1. A simple application of 
the definition of the exponential function gives F = exp M and G = exp N. Written out 
explicitly, these statements are 



/0 1 
1 



F = exp 



\ 



1 
0/ 



(2.7) 



and 



/0 ffl 

a 2 



G = exp 



V 



(2.8) 



/ 



The matrices M and N will be used later in a proof. 

Our expression for the n-th order term in the Baker-Campbell-Hausdorff series is 

^ = T(logFG)i, n+ i. 



(2.9) 



The indices on the right-hand side of this equation indicate the upper-right element of the 
matrix logFG. The operator T replaces products of a- variables with products of x and y 
according to the following procedure. The polynomial (logFG)i jn+ i is a sum of terms, each 
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of which may be written as a rational number times <rf 1 cr^ 2 • • • cr^ n , where the /ij are either 
or 1 (no exponents greater than 1 occur, as explained later in this paper). Next, erf 1 is 
replaced with x if ^ = and y if //j = 1. Thus each a, that occurs (to the first power) 
in a term indicates that a y is to be placed at the z-th location in the product of x and y 
variables. For example, in the case n = 6, we have T(o"2 04 05) = xyxyyx. The operator T is 
a vector-space isomorphism from the space of polynomials in the a- variables (with /i« < 1) 
to the space of linear combinations of products that have n factors that are either x or y. 

The log operation in Eq. ( |2.9| ) is simple because FG is equal to the (n + 1) x (n + 1) 
identity matrix (which we denote by /) plus a matrix that is strictly upper triangular. Thus 
the series for log[7 + (FG — I)] terminates after finitely many terms. 



The calculation of z nj the order n term in the Baker-Campbell-Hausdorff series, can therefore 
be carried out with a finite number of simple operations. There are no sums over partitions, 
operations with noncommuting variables, translations of binary sequences into descriptions 
in terms of block lengths, etc. 




9=1 



(2.10) 



III. EXAMPLES 



Let us begin by working out the example of n = 1. We have 




(3.1) 



and 




(3.2) 



From this follows 




(3.3) 



and 



zi = T (log FG) 1,1+1 



(3.4) 
(3.5) 



= x + y . 



Next let us work out the example of n 



2 



We have 




(3.6) 



and 
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G= \ 1 n, J . (3.7) 

From this follows 

FG= ( 1 l+aa | (3.8) 

and 

(0 1 + CTi + 02 + CTl 0"2 

| , (3.9) 


so that 

z 2 = T (log FG\ 2+1 = T (lo°al - (3.10) 

= ^(xy-yx). (3.11) 
For the case n — 3, the equations result in 

Z3 = II G\ CT 2 H 0"3 H CTi Go CTl H Oo 0"3 (3.12) 

\12 6 12 12 6 12 ) v ' 

111111 , 

= j^yxx - -xyx + —xxy + j^yyx - -yxy + —xyy . (3.13) 

The case n = 4 works out to be 

( 1 i i i A , 

Z4 = T I -—0i cr 2 + —fxi ^3 - Y2 a2(7i+ 24 a3<74 J ^ 3 ' 14 ^ 

= ——yyxx + —yxyx — —xyxy + —xxyy . (3.15) 

These results and higher-order calculations not shown here agree with results published 
in the literature ||,f|] . As an example of the types of coefficients that occur, when n is 7 
our formula gives a coefficient of —1/1512 for the yxxxyyy term, and this agrees with the 
literature result. 



IV. PROOF OF THEOREM 

We begin by considering the Baker-Campbell-Hausdorff series for log [e M e N ) , where the 
(n + 1) x (n + 1) matrices M and iV are defined in Eqs. (|2.5|) and ( |2.6|) . The matrices M 
and iV are written out explicitly on the right-hand sides of Eqs. ( |2.7| ) and ( J2.8[ ). They have 
nonzero elements only on the first superdiagonal. Therefore, a product having m factors 
that are either M or N will have nonzero elements only on the m-th superdiagonal. Thus, 
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the upper-right element of the matrix log (e M e N j is equal to the upper-right element of the 
matrix that is the order n term in the Baker- Campbell-Hausdorff series for log (e M e N ) . We 
write this as 

N (e M e N )} l = C (W) [n(W)] lin+1 , (4.1) 

w 

where the sum runs over all "words" W of length n (ordered n-tuples of elements that are 
either the symbol M or the symbol N), C(W) denotes the coefficient of W in the order n 
term in the Baker- Campbell-Hausdorff series, and II(W) denotes a product of M and iV 
matrices as specified by the word W. 

We now show that [II(W)]i )n _)_i is a product of u- variables whose indices give the positions 
of the iV's in the word W. We let the matrix I1(W / ) act on an (n + l)-component column 
vector that is all zeroes except the lowest element, which is a 1. After each multiplication 
by an M or an N matrix, the location of the nonzero element in the column vector moves 
up by one step. If the matrix multiplying the column vector is an N, the nonzero element 
in the column vector gets multiplied by a a. The index on the a gives the location of the N 
matrix in the word W, as can be seen by looking at the structure of the N matrix, shown on 
the right-hand side of Eq. fl2.8|) . After all of the n matrices in the word W have acted on the 
column vector, the nonzero element in the vector is at the top, and this element is a product 
of a- variables whose indices describe the word W in the manner explained above. The top 
element of the vector obtained by letting a matrix act on the initial column vector described 
above is the upper-right element of the matrix. Thus we have shown that [n(W)]i (n +i is a 
product of a-variables whose indices give the positions of the iV's in the word W. This fact 
together with the relations F = expM and G = expiV and Eq. ( [4.1| ) proves Eq. ( |2.9| ). 



V. ALTERNATIVE FORMULATION 



In this section we present a result equivalent to the result presented above, but the 
matrix operations involve only numbers. We will express the n-th order term, linear 
combination of terms of the form (x + a\ y) (x + 02 y) • • • (x + a n y), where the ex's are either 
+ 1 or —1. There are 2 n such terms. Our result is that the coefficient of a given term is 
2~ n times the value of the polynomial (logFG) lin+1 defined Sec. 0, with the corresponding 
a- values substituted in. This number can be computed by substituting the cx-values into the 
G matrix before doing the matrix operations. These statements can be summarized as 



2 n (log FG) hn+1 (x + a iy )(x + a 2 y) ■■■ (x + a n y) 



where the sum is over all 2 n possible assignments of ±1 to the cx-variables. 
Let us work out an example. For n = 2 the equation becomes 

Y ~ Y 



Z2 



1-2 



E 



(x + a 1 y)(x + a 2 y) 



(5.1) 



(5.2) 



1 



~{xy-yx) 



-y - \ ) (•'■+//)(•'■ - '/) 



(5.3) 
(5.4) 
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where we have used Eq. ( |3.10| ). 

A calculation of z n without the use of multiplication of matrices of polynomials would 
proceed in the following way. For every choice of +1 or —1 values for the a- variables 
one computes the value of 2~ n (log FG) i <n+ i. This involves a finite number of operations 
with numbers. The result is the coefficient of (x + o\ y) (x + 02 y) ■ ■ ■ (x + o n y) in the 
expression for z n in Eq. (|5.1|) . Next, we imagine the process of expanding all of the products 
(x + <J\ y) (x + 02 y) ■ • ■ (x + o n y). The result of this operation is a sum over words in the 
variables x and y. To get the coefficient of a particular word one sums all of the coefficients 
calculated from 2~ n (logFG) l n+1 with a sign given by the product of the 0's at the locations 
of the y's in the word. For example, if the word is xxyyxy then the coefficients are summed 
with signs given by 03 04 0%. 

We now prove Eq. ( |5.1| ). We consider the order n term in log(e x+y e x ~ y ) . This may be 
written in two ways, 

C(a 1 ,...,a n )(x + a 1 y)---(x + a n y) = Y,C'(W)W. (5.5) 

<Ti,...,a n W 

The sum on the left-hand side is a sum over all assignments of +1 or —1 to the variables 
0i,..., a n . The coefficient C(ai, . . . , cr n ) is the usual coefficient in the Baker-Campbell- 
Hausdorff series, with the 0's identifying a word. The right-hand side of Eq. ( |5.5| ) results 
from multiplying out all of the products (x + a% y) • • • (x + a n y) . It is a sum over words in 
x and y, and C'(W) denotes the resulting coefficient of the word W . [In this context we 
use the term "word" to denote an actual product of a certain sequence of x and y variables, 
because such a product does not evaluate to become something else, as it did in the case of 
the products of M and N matrices in the previous section. Thus, the notation I1(W / ) is not 
needed in Eq. ( [5.5D -1 The coefficient C'(W) of a particular word W can be expressed in terms 
of the C(ai, . . . , n ). For every set of values for a±, . . . , a n in the sum on the left-hand side 
we get a contribution to C'(W) of C(ai, . . . , a n ) times the product of the 0's that correspond 
to the y's in W. This product is the same as the product of <j[ W \ where the 0^ values 
describe the word W (cr\ W ^ is +1 if the i-th factor in W is x, and is -1 if the i-th factor 
in W is y), and the product runs over i- values corresponding to negative 0's. Thus, C'(W) 
is precisely the polynomial in <j( w > given in the first theorem. Now we transform from x 
and y to new variables according to x + y = x and x — y = y. This implies x = (x + y)/2 
and y = (x — y)/2, and the right-hand side of Eq. (|5.5| ) (which equals loge x e y ) becomes the 
right-hand side of Eq. (|5~T|), after the tildes have been dropped, which is justified since only 
variables with tildes occur at that point. The n factors of 1/2 are collected into the factor 
of 2- n in Eq. Q. 

VI. SYMMETRIES OF THE COEFFICIENTS 

The calculation described in the penultimate paragraph of the preceding section involves 
repeated evaluation of (logFG)i jn+ i with values of +1 and —1 substituted in for the 0- 
variables. These numbers are the coefficients in the sum in Eq. (|5.1| ). We begin this section 
by showing that half of the resulting numbers will be zero because of a basic symmetry of the 
Baker-Campbell-Hausdorff series. Then we show that some of the nonvanishing coefficients 
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can be obtained from other ones. These considerations reduce the computational work 
involved in calculating the coefficients in Eq. fl5.1|) . 

The relationship e z = e x e y implies e~ z = e~ y e~ x and 

z = -\oge- y e~ x . (6.1) 

From this we see that swapping the x's and t/'s in the n-th order term z n gives the same 
result as multiplying z n by (— l) n_1 . In the present context, the o-variables are being assigned 
values of +1 or —1, so we may use the relationship of = 1. The preceding statement about 
the symmetry of z n is equivalent to 

n 

(logFG) lin+1 ]Ja t = (logFGV+L (6.2) 

i=l 

Multiplication by Yl7=i a * effects a swapping of x and y, because of the relationship of = 1. 
If the number of o's that are +1 is even then Y]2=i Gi ec L ua l ( — l) n because all of the o's in 
the product may be replaced with —1 without changing the value of the product. Therefore, 
if the number of o's that are +1 is even, then (log -FG) l n+1 is zero. Elementary combinatorics 
shows that this condition holds for one-half of the terms in the sum in Eq. ( |5.1| ). In the case 
of odd n greater than 1, there is one additional term which vanishes, namely the one for 
which all of the o's are +1. This is because the matrices M and N are equal and therefore 
commute. Thus log FG is equal to M + N and the upper-right element of logFG is zero. 
[In the case of even n, (log FG) i, n+ i of course also vanishes when all of the o's are +1, but 
this vanishing has already been counted in the discussion above.] 

Thus far, we have identified coefficients in the sum in Eq. ( p.l| ) that are zero for symmetry 
reasons. This author has searched up through order n — 15 and found all of the remaining 
coefficients to be nonzero. 

A further symmetry of (log FG) i >n+ i is 

[(logFGV+iKo!, . . . , a n ) = (-l) n - 1 [(logFG) lin+1 ](a n , ...,a l ). (6.3) 

The notation on the left-hand side of this equation indicates explicitly that (logFG)i in+ i is a 
function of the n variables Oi, . . . , o n . On the right-hand side, these n quantities are inserted 
into the function in the reversed order. The fact that these two values of the function are 
related by a factor of (— l)™ -1 is due to a symmetry of the Baker-Campbell-Hausdorff series. 
It follows immediately from results in Ref. that the coefficient of a word in the variables 
x and y is equal to (— l) n_1 times the coefficient of the word obtained by reversing the order 
of the factors in the original word. This implies that T{[(logFG)i >n+ i](o n , . . . , Oi)} (where 
T is the operator defined in Sec. ||), which is the order n term in the Baker-Campbell- 
Hausdorff series with the sequence of the factors in each term reversed, is equal to (— l) n_1 
times T{[(logFG)i jn+ i](oi, . . . , o n )}. Because the vector-space isomorphism T is invertible, 
this proves Eq. (|6.3|) . This equation is useful because it can be used to avoid carrying out 
unnecessary evaluations of (logFG) l n+1 . 

VII. COMPUTER IMPLEMENTATION 

The methods presented in this paper can easily be used with computers. A simple 
example of how the results of Sec. |I| can be implemented using Mathematica is shown 



S 



below. It is not necessary to load any special packages to run this code. This example is 
oriented toward ease of coding. Faster implementations are possible. The first program 
gives the polynomial to the right of the T operator in Eq. ( |2.9| ), and the second program 
(for n > 1) translates this into z n , the corresponding expression in terms of x and y. 

p[n_] := p[n] = ( F = Table [1/ (j-i) !, {i ,n+l} , {j ,n+l}] ; 

G = Tabled/ (j-i)! Product [s [k] ,{k, i ,j-l}] , {i ,n+l} , {j ,n+l}] ; 
qthpower = IdentityMatrix [n+1] ; FGml = F.G - qthpower; Expand [ 
-Sum[qthpower=qthpower .FGml ; (-l)"q / q qthpower, {q, n}] [[1, n+1]]] ) 

translated [n_] := (temp = Expand [Product [s [k] "2 , {k,n}] p[n]]; 

Sum [term = Apply [List, temp[[i]]]; term[[l]] Apply [String Join, 
Take[term,-n] /. {s [i_] ~2->"x" , s [i_] ~3->"y"}] , { i, Length [t emp] }] ) 

In [3]:= translated [4] 

xxyy xyxy yxyx yyxx 

Out [3]= + 

24 12 12 24 



VIII. OTHER SERIES 

As in the case of Goldberg's results M, the methods of this paper can be used to calculate 
log f{x)f(y), where f(x) is an arbitrary power series with /(0) = 1. The only changes that 
are necessary are that occurrences of the exponential function such as those in Eqs. 



and ( |2.8| ) must be replaced with the function /. The matrices M and N have the property 
that when they are raised to the power n + 1 the result is zero, so the calculation of f{M) 
and f(N) terminates after finitely many matrix operations. 



IX. GENERALIZED BAKER-CAMPBELL-HAUSDORFF SERIES 

The methods presented in Sec. |1| may also be used to calculate the terms in generalized 
Baker-Campbell-Hausdorff series. For example, if z = log e x e y e w then the n-th order term 
may be found as follows. Matrices F and G are defined as in Eqs. Q2.1]) and (|2.2j ), and a 
matrix H is defined by 

1 

Hij = . . _ \\ r k , i,j = l,...,n + l, (9.1) 

w % )' k=i 

where T\, . . . , r n are n additional commuting variables. The definition of H is the same as 
the definition of G, except that different variables are used. Reasoning similar to that in the 
original case gives the following expression for z n . 

z n = T (log FGH) 1>n+1 , (9.2) 
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where the definition of the T operator now has been extended to also putting a w at the 
z-th position of a product of x's, y's and w's for an occurrence of r,. For example, when n is 
4, we have T (a 2 r z) — xywx. The results obtained from Eq. ( |9.2| ) agree with those obtained 
from Reutenauer's generalization || of Goldberg's theorem. 

An example of how these methods can be used with Mathematica is shown below. For 
ease of coding, the notation has been changed slightly. 

p3[n_] := p3[n] = ( 

F = Table [1/ (j-i) ! Product [s [k, "x"] , {k, i ,j-l>l , "Ci ,n+l}, {j ,n+l}] ; 
G = Table [l/(j-i) ! Product [s [k, "y"] , {k, i ,j-l>] , {i ,n+l}, {j ,n+l}] ; 
H = Table [l/(j-i) ! Product [s [k, "w"] , {k, i ,j-l}] , {i ,n+l}, {j ,n+l}] ; 
qthpower = IdentityMatrix [n+1] ; FGml = F.G.H - qthpower; Expand [ 
-Sum[qthpower=qthpower.FGml; (-l)"q / q qthpower, {q, n}] [[1, n+1]]] ) 

translated3 [n_] := (temp = p3[n]; Sum [term = Apply [List, temp[[i]]]; term[[l]]* 
Apply [StringJoin, Take [term, -n] /. s [j_,k_] ->k] , {i , Length [temp] }] ) 

In [3]:= translated3[2] 

-wx wy xw xy yw yx 

Out [3]= + — + — + 

2 2 2 2 2 2 

X. CONCLUSION 

The results contained in this paper provide a means of computing the entire n-th order 
term in the Baker-Campbell-Hausdorff series, without the use of noncommuting variables, 
sums over partitions, or other complicated operations. One application is in writing simple 
programs in standard computer languages to calculate the Baker-Campbell-Hausdorff series. 
Such programs do not result in a significant reduction in computer time needed. Rather, 
the programming involved is simplified. The sample program included in this paper can be 
shortened to just a few lines, and it is not necessary to load special software packages. The 
calculation of higher-order terms in the Baker-Campbell-Hausdorff series is usually done in 
computer languages that do not have symbol manipulation, because they are faster. This 
paper also explains how a simple program can be written in such a language to calculate 
the series. 

This paper does not address the question of expressing the Baker-Campbell-Hausdorff 
series in terms of commutators. As explained in Sec. [H], if an expression in terms of com- 
mutators is desired, the substitution due to Dynkin |H^-|7|] may be used to transform the 
results calculated here. 

Future research could include finding alternative ways to calculate certain sequences 
of elements in graded free Lie algebras, such as those in Ref. [[HJ (which also contains 
an interesting method of computing the Baker-Campbell-Hausdorff series by numerically 
integrating a differential equation). These quantities occur in the optimization of numerical 
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algorithms involving computations in Lie algebras. Graded Lie algebra bases can be used in 
the construction of Runge-Kutta methods on manifolds. 
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